Model Selection

Multimodal adaptation

# Multimodal adaptation

Webssl Dino7b Full8b 378

A 7-billion-parameter vision Transformer model trained on 8 billion language-unlabeled web images, achieving exceptional visual representation capabilities through self-supervised learning

Image Classification

Tiny Random Phi 4 Multimodal

This is a tiny model for debugging, randomly initialized based on the adjusted configuration, specifically designed for rapid process verification.

Aimv2 1b Patch14 224.apple Pt

AIM-v2 is an image encoder model based on the timm library, with a scale of 1 billion parameters, suitable for image feature extraction tasks.

Image Classification

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase